feat: wire TreeSitterChunker into LibScopeLite.index() via preChunked by RobertLD · Pull Request #461 · RobertLD/libscope

RobertLD · 2026-03-19T21:22:43Z

Summary

Add preChunked?: string[] to IndexDocumentInput — when provided, indexDocument uses these chunks directly, bypassing the markdown chunker
LibScopeLite.index() now checks doc.language: if set and supported by TreeSitterChunker, pre-chunks the content at function/class boundaries and passes result as preChunked; falls back silently to the text chunker on any error
Update LiteDoc docs (lite.md, lite-api.md) to mark setting language as the preferred approach over using TreeSitterChunker directly
7 new tests across lite.test.ts and indexing.test.ts

Test plan

npm run typecheck — no new errors
npm test — 1488 tests pass (7 new)
index() with supported language → chunk() called, results passed as preChunked
index() with unsupported/no language → text chunker used, no exception
index() when tree-sitter throws → falls back silently, indexing succeeds
indexDocument with preChunked → chunks stored verbatim in DB
indexDocument with empty/undefined preChunked → normal text chunking

🤖 Generated with Claude Code

Add `preChunked?: string[]` to `IndexDocumentInput` — when provided, `indexDocument` skips the markdown chunker and uses the caller's chunks directly. `LibScopeLite.index()` now checks `doc.language`: if set and supported, it pre-chunks the content with `TreeSitterChunker` and passes the result as `preChunked`. Falls back silently to the text chunker on any error (tree-sitter not installed, parse failure, etc.). Consumers set `language: "cpp"` (or any supported alias) on their `LiteDoc` and get function/class-boundary chunks automatically. Docs updated to note this as the preferred approach over using `TreeSitterChunker` directly. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

vercel · 2026-03-19T21:22:49Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Actions	Updated (UTC)
libscope	Ignored	Preview	Mar 19, 2026 9:25pm

sonarqubecloud · 2026-03-19T21:23:30Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

* fix: export TreeSitterChunker and CodeChunk from libscope/lite TreeSitterChunker was compiled but not re-exported from the ./lite entry point, making it inaccessible to consumers using the package exports map. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> * feat: wire TreeSitterChunker into LibScopeLite.index() via preChunked (#461) Add `preChunked?: string[]` to `IndexDocumentInput` — when provided, `indexDocument` skips the markdown chunker and uses the caller's chunks directly. `LibScopeLite.index()` now checks `doc.language`: if set and supported, it pre-chunks the content with `TreeSitterChunker` and passes the result as `preChunked`. Falls back silently to the text chunker on any error (tree-sitter not installed, parse failure, etc.). Consumers set `language: "cpp"` (or any supported alias) on their `LiteDoc` and get function/class-boundary chunks automatically. Docs updated to note this as the preferred approach over using `TreeSitterChunker` directly. Co-authored-by: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> * style: fix prettier formatting Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>

RobertLD merged commit e14cfe6 into fix/export-treesitter-chunker Mar 19, 2026
2 checks passed

RobertLD deleted the feat/treesitter-preChunked-lite-index branch March 19, 2026 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: wire TreeSitterChunker into LibScopeLite.index() via preChunked#461

feat: wire TreeSitterChunker into LibScopeLite.index() via preChunked#461
RobertLD merged 1 commit intofix/export-treesitter-chunkerfrom
feat/treesitter-preChunked-lite-index

RobertLD commented Mar 19, 2026

Uh oh!

vercel Bot commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

sonarqubecloud Bot commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RobertLD commented Mar 19, 2026

Summary

Test plan

Uh oh!

vercel Bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud Bot commented Mar 19, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vercel Bot commented Mar 19, 2026 •

edited

Loading